Wrapper and Hybrid Feature Selection Methods Using Metaheuristic Algorithms for English Text Classification: A Systematic Review
نویسندگان
چکیده
Feature selection (FS) constitutes a series of processes used to decide which relevant features/attributes include and irrelevant features exclude for predictive modeling. It is crucial task that aids machine learning classifiers in reducing error rates, computation time, overfitting, improving classification accuracy. has demonstrated its efficacy myriads domains, ranging from use text (TC), mining, image recognition. While there are many traditional FS methods, recent research efforts have been devoted applying metaheuristic algorithms as techniques the TC task. However, few literature reviews concerning TC. Therefore, comprehensive overview was systematically studied by exploring available studies different improve This paper will contribute body existing knowledge answering four questions (RQs): 1) What approaches apply TC? 2) Does lead better accuracy than typical methods? 3) How effective modified, hybridized problems?, 4) gaps current their future directions? These RQs led study works on metaheuristic-based contributions, limitations. Hence, final list thirty-seven (37) related articles extracted investigated align with our generate new domain study. Most conducted papers focused addressing tandem based wrapper hybrid approaches. Future should focus using hybrid-based approach it intuitively handles complex optimization problems potentiality provide opportunities this rapidly developing field.
منابع مشابه
A Classification Method for E-mail Spam Using a Hybrid Approach for Feature Selection Optimization
Spam is an unwanted email that is harmful to communications around the world. Spam leads to a growing problem in a personal email, so it would be essential to detect it. Machine learning is very useful to solve this problem as it shows good results in order to learn all the requisite patterns for classification due to its adaptive existence. Nonetheless, in spam detection, there are a large num...
متن کاملHybrid feature selection for text classification
Feature selection is vital in the field of pattern classification due to accuracy and processing time considerations. The selection of proper features is of greater importance when the initial feature set is considerably large. Text classification is a typical example of this situation, where the size of the initial feature set may reach to hundreds or even thousands. There are numerous researc...
متن کاملHybrid Active Feature Selection For Text Classification
Clustering is the most common form of unsupervised learning.In clustering, it is the distribution and makeup of the data that will determine cluster membership. It needs representation of objects and similarity measure. which compares distribution of features between objects. For the high dimensionality, feature extraction and feature selection improves the performance of clustering algorithms....
متن کاملFeature Selection and Classification { a Probabilistic Wrapper
Feature selection is deened as a problem to nd a minimum set of M features for an inductive algorithm to achieve the highest predictive accuracy from the data described by the original N features where M N. A probabilistic wrapper model is proposed as another method besides the exhaustive search and the heuristic approach. The aim of this model is to avoid local minima and exhaustive search. Th...
متن کاملH-BwoaSvm: A Hybrid Model for Classification and Feature Selection of Mammography Screening Behavior Data
Breast cancer is one of the most common cancer in the world. Early detection of cancers cause significantly reduce in morbidity rate and treatment costs. Mammography is a known effective diagnosis method of breast cancer. A way for mammography screening behavior identification is women's awareness evaluation for participating in mammography screening programs. Todays, intelligence systems could...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2022
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2022.3165814